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Abstract: 


'  It  is  possible  to  represent  a  finite  set  of  points  (atoms)  by  a  finite  sequence  of  points.  However  a 
finite  set  of  points  has  no  distinguished  member  and  therefore  it  is  impossible  to  define  a  function  which  takes 
a  finite  set  of  points  and  returns  a  first  point  in  that  set.  TTius  it  is  impossible  to  represent  a  finite  sequence 
of  points  by  a  finite  set  of  points.  Hie  theory  of  symmetric  sets  provides  a  framework  in  which  this 
observation  about  sets  and  sequences  can  be  proven.  Hie  theory  of  symmetric  sets  is  similar  to  classical 
(Zcrmcllo-Fraenkcl)  set  theory  with  the  exception  that  the  universe  of  symmetric  sets  includes  points 
(ur-clemcnts).  Points  provide  a  basis  for  general  notions  of  isomorphism  and  symmetry.  The  general  notions 
of  isomorphism  and  symmetry  in  turn  provide  a  basis  for  natural,  simple,  and  universal  definitions  of 
abstractness,  essential  properties  and  functions,  canonicality.  and  representations.  It  is  expected  that  these 
notions  will  play  an  important  role  in  the  theory  of  data  struefres  and  in  die  constaiction  of  general 
techniques  for  reasoning  about  data  structures. 
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1.  INTRODUCTION 


Finite  sets  of  points  can  be  represented  by  finite  lists  of  points,  but  it  is  impossible  to  represent  finite 
lists  of  points  with  finite  sets  of  points.  This  is  because  a  set  of  points  has  no  distinguished  member  and 
therefore  it  is  impossible  to  define  a  function  which  maps  a  set  of  points  to  a  "first"  point  in  that  set.  While 
this  simple  observation  seems  straightforward  and  correct,  it  is  very  difficult  to  prove.  What  is  meant  by 
"represent"  or  by  "a  distinguished  member"?  Intuitively  a  list  has  more  structure  or  contains  more 
information  than  a  set.  But  what  does  this  mean?  One  approach  to  defining  the  meaning  of 
"representations",  "distinguished  members"  and  "more  structure"  is  to  study  the  general  nature  of 
mathematical  objects  (such  as  sets  and  sequences).  One  approach  to  the  general  nature  of  mathematical 
objects  is  set  theory.  Symmetric  set  theory  is  a  new  set  theory  which  provides  simple  universal  definitions  of 
the  above  notions. 

Currently  the  most  widely  studied  formal  theory  of  mathematics  as  a  whole  is  Zermello-Fracnkel  set 
theory  (ZF)  and  its  variants.  The  primary  difference  between  ZF  set  theory  and  symmetric  set  theory  involves 
points.  A  point  is  an  object  which  has  no  members  (and  is  thus  not  a  set).  In  ZF  set  theory  there  is  only  one 
point  (the  null  set)  while  symmetric  set  theory  requires  the  existence  of  many  points.  The  following 
discussion  of  this  issue  is  from  the  introduction  to  Foundations  of  Set  Theory  by  Fracnkcl,  Bar-Hillcl,  and 
Levy  [Fraenkcl  ct.  al.  58]  (they  refer  to  points  as  individuals). 

Let  us  refer  to  those  elements  which  have  members  as  sets  and  to  those  elements  which  have  no 
members  as  individuals. ... 

The  existence  of  at  least  one  individual  is  called  for  by  both  philosophical  and  practical  reasons. 

...  Let  us,  however,  stress  that  referring  to  one  of  the  individuals  as  the  null  set  is  done  only  for  reasons 
of  convenience  and  simplicity,  and  can  be  regarded  as  a  mere  notational  convention. 

Having  decided  that  we  need  an  individual  we  now  face  the  question  of  whether  we  need  more 
than  one  individual.  It  turns  out  that  for  mathematical  purposes  there  seems  to  be  no  real  need  for 
individuals  other  than  the  null  set  Therefore  we  shall  not  admit  any  such  individuals  into  ZF. 

While  it  is  true  that  most  mathematics  can  be  done  in  a  framework  where  the  null  set  is  the  only 
point  there  arc  notions  which  are  best  defined  in  a  framework  where  many  points  arc  present.  For  example 
every  mathematical  object  seems  to  have  a  natural  notion  of  isomorphism  associated  with  it.  There  is  a 
natural  notion  of  what  it  means  for  two  Turing  machines,  or  context  free  grammars,  or  topological  spaces  to 
be  isomorphic.  ZF  set  theory  provides  no  satisfactory  general  notion  of  isomorphism.  Hie  terms 
"representation",  "distinguished  element",  and  "more  structure"  arc  very  hard  to  define  in  a  framework 
where  only  one  point  is  present.  However  these  notions  can  be  given  simple  universal  definitions  in  the 
presence  of  many  points. 

Hie  notion  of  isomorphism  can  be  approached  from  two  different  directions.  The  first  is  to  extend 
standard  notions  of  isomorphism  for  particular  types  of  objects  to  a  notion  of  isomorphism  for  arbitrary  sets 
which  are  built  up  out  of  points.  This  type  of  isomorphism  will  be  called  a  structural  isomorphism.  The 
second  approach  to  the  notion  of  isomorphism  is  to  consider  the  symmetries  (automorphisms)  of  a  universe 
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<U  €>.  A  pair  <U  €>  is  a  universe  of  objects  where  U  is  a  universal  domain  and  €  is  a  binary  relation  on  U 
where  xCy  is  read  "x  is  an  element  of  y”.  A  point  of  <U  €>  is  an  element  p  of  U  such  that  there  is  no  x  in  U 
such  that  x€p.  1'he  universe  of  symmetric  sets  has  lots  of  points  and  lots  of  automorphisms.  In  particular 
there  is  a  natural  one  to  one  correspondence  between  the  the  permutations  of  die  points  of  <U  O  and  the 
automorphisms  of  <U  €>  (since  ZF  set  theory  allows  only  one  point  there  are  no  non-trivial  symmetries  of 
<U  O  in  ZF  set  theory).  In  the  theory  of  symmetric  sets  it  can  be  shown  that  two  objects  x  and  y  are 
structurally  isomorphic  just  in  case  there  is  an  automorphism  (a  symmetry)  of  <U  €>  which  maps  x  to  y.  This 
result  justifies  the  intuition  that  isomorphic  objects  arc  indistinguishable. 

Philosophers  of  mathematics  have  observed  that  mathematical  descriptions  of  structures  such  as  the 
natural  numbers  do  not  determine  die  identity  of  those  structures  (Bcnaccrraf  65|.  Hie  best  one  can  hope  to 
do  is  to  determine  identity  "up  to  isomorphism".  'Ihc  theory  of  symmetric  sets  reflects  this  observation.  A 
specification  for  a  particular  object  x  might  be  a  sentence  <t>  such  dial  <J>(y)  holds  just  in  ease  y  is  x.  However 
consider  any  first  order  formula  4>  of  one  free  variable  whose  only  non-logical  symbol  is  €.  If  x  and  y  arc 
isomorphic  objects  then  there  is  a  symmetry  of<U  €>  which  maps  x  to  y  and  tints  «I>< x)  holds  in  <U  €>  just  in 
ease  «h(y)  holds  in  <U  O.  Thus  if  there  arc  several  different  objects  which  are  isomorphic  to  x  then  no  such 
sentence  <I>  can  name  x.  In  the  theory  of  symmetric  sets  there  are  always  many  objects  which  are  isomorphic 
to  x. 

The  theory  of  symmetric  sets  provides  a  simple  and  natural  measure  of  die  "abstractness"  of  objects. 
A  more  abstract  object  is  an  object  with  "less  structure".  A  precise  definition  of  this  notion  can  be 
approached  in  three  different  ways.  First  the  symmetries  of  the  universe  of  symmetric  sets  greatly  restricts  die 
predicates  and  functions  which  can  be  defined  in  terms  of  the  structure  of  <U  O.  A  function  F  will  be  called 
essential  if  it  commutes  with  any  symmetry  (automorphism)  of  <U  €>,  i.e.  for  any  symmetry  p  of  <U  €>  and 
any  element  x  of  U,  p(F(x))  must  equal  F(p(x)).  Given  two  elements  x  and  y  of  U  there  may  not  exist  any 
essential  function  F  which  maps  x  to  y.  For  example  there  is  no  essential  function  w  hich  maps  a  set  of  points 
to  an  element  of  that  set  An  object  y  is  called  an  abstraction  of  an  object  x  just  in  ease  there  is  an  essential 
function  F  which  maps  x  to  y.  This  notion  of  abstraction  can  also  be  approached  by  studying  die  symmetries 
of  particular  objects.  The  symmetry  group  of  an  object  x.  denoted  A(x).  is  the  set  of  all  automorphisms  of 
<L  O  which  leave  x  fixed,  flic  notion  of  an  abstraction  can  also  be  defined  by  saying  dial  y  is  an  abstraction 
of  x  just  in  case  A(y)  contains  A(x).  A  diird  approach  to  die  notion  of  abstraction  is  via  die  notion  of 
contextual  isomorphism  and  die  general  notion  of  a  "canonical"  object.  Two  objects  x  and  y  arc  said  to  be 
isomorphic  in  die  context  of  /.  just  in  ease  there  is  a  symmetry  of  t  which  maps  x  to  y.  For  example  consider  a 
circle  and  two  points  p  and  q  which  arc  in  die  same  plane  as  the  circle.  Hie  points  p  and  q  are  isomorphic  in 
the  context  of  the  circle  just  in  case  they  arc  die  same  distance  from  the  center  of  the  circle.  An  object  y  is  said 
to  be  canonical  in  the  context  of  an  object  i  just  in  case  the  isomorphism  class  of  y  in  the  context  of  /,  |y|r  is  a 
singleton  set  (there  is  no  canonical  point  on  a  circle  or  canonical  corner  on  a  square).  It  turns  out  dial  y  is  an 
abstraction  of  x  just  in  ease  >  is  canonical  in  the  context  of  x. 

Given  the  above  notion  of  abstractness  it  is  possible  to  provide  a  precise  notion  of  "representation". 
For  example  finite  sets  can  lie  represented  as  finite  sequences  because  there  is  an  essential  function  which 
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maps  any  sequence  to  to  the  corresponding  set.  However  it  is  easy  to  show  that  there  is  no  essential  function 
which  maps  finite  sets  of  points  to  finite  sequences  of  points.  Thus  sequences  can  not  be  represented  as  sets. 

Mathematics  is  often  done  in  the  framework  of  some  fixed  but  arbitrary  context.  Intuitively  a 
context  is  a  collection  of  objects  which  arc  taken  to  be  fixed  during  the  course  of  a  mathematical  discussion. 
'Hie  natural  numbers,  the  real  numbers,  and  the  empty  set  arc  all  usually  assumed  to  be  fixed  objects  even 
though  their  "true  identity"  can  not  be  specified.  The  result  relating  essential  functions,  symmetry  groups, 
and  canonical  objects  can  be  generalized  to  account  for  context. 

In  addition  to  the  different  treatment  of  points  there  is  another  less  important  distinction  between 
symmetric  set  theory  and  ZF  set  theory.  The  axioms  of  Z.F  set  theory  are  (an  infinite  number  of)  sentences  of 
first  order  logic  while  the  axioms  of  symmetric  set  theory  arc  precise  conditions  on  the  universe  <U  €>  which 
arc  stated  in  Fnglish  rather  than  first  order  logic.  Thus  symmetric  set  theory  avoids  all  of  the  clumsiness  of 
first  order  logic.  Furthermore  it  is  shown  in  an  appendix  that  a  simple  extension  of  the  axioms  of  sy  mmetric 
set  theory  specify  the  structure  of  <U  €>  up  to  isomorphism,  something  which  could  never  be  done  in  first 
order  logic. 


J  tlk  - 
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2.  THE  AXIOMS  OF  SYMMETRIC  SET  THEORY 


In  deciding  on  axioms  for  a  universe  of  mathematical  objects  there  are  several  considerations.  First 
the  axioms  should  be  as  clear,  simple,  and  natural  as  possible.  Second,  since  7.F  set  theory  is  well  established 
the  axioms  should  not  differ  unnecessarily  from  ZF  set  theory.  Finally,  and  most  interestingly,  the  axioms 
should  provide  a  basis  for  defining  universal  notions  of  isomorphism,  abstraction,  and  representation. 
However  one  should  not  expect  to  immediately  see  how  the  axioms  of  symmetric  set  theory  provide  a  basis 
for  general  notions  of  isomorphism,  abstraction,  and  representation.  These  notions  can  be  defined  only  after 
the  consequences  of  the  simple  set  theoretic  axioms  have  been  investigated. 

No  proof  of  the  consistency  of  the  axioms  of  symmetric  set  theory  is  presented  in  this  section. 
However  it  is  shown  in  an  appendix  that  the  consistency  of  the  axioms  is  equivalent  to  the  existence  of  a 
strongly  inaccessible  cardinal.  The  appendix  also  shows  that  any  universe  satisfying  the  axioms  is  determined 
up  to  isomorphism  by  a  "height"  and  a  "width"  where  the  height  can  be  any  strongly  inaccessible  cardinal 
and  the  width  can  be  any  cardinal  at  least  as  large  as  the  height.  Thus  if  there  arc  strongly  inaccessible 
cardinals  then  there  arc  many  different  (non-isomorphic)  universes  satisfying  the  axioms.  There  is  however  a 
unique  (up  to  isomorphism)  minimal  universe  <U  €>  whose  height  and  width  are  both  tire  least  strongly 
inaccessible  cardinal.  Throughout  the  following  sections  however  the  universe  will  be  taken  to  be  some  fixed 
but  arbitrary  model  of  the  axioms. 

2.1.  The  Nature  of  the  Universe  <U  €> 


The  universe  of  symmetric  sets  is  taken  to  be  a  pair  <U  €>  where  U  is  some  domain  and  €  is  a 
binary  relation  on  U.  Some  elements  of  U  can  be  thought  of  as  sets  in  the  standard  way.  For  example 
consider  the  pair  <U  €>  given  as  follows: 

U  is  the  set  {a  b  c  d  c  1}.  The  relation  €  is  given  by: 

a£d  b€d  c€d 
a€c  b€e 
a€f  c£f 


In  this  situation  a.  b,  and  c  arc  points. 

Definition:  A  point  is  an  element  p  of  U  which  has  no  members,  i.c.  x<f  p  for  all  x  in  U. 

In  the  above  example  the  element  d  represents  the  set  {a be},  c  represents  the  set  {ab},  and  f 
represents  the  set  {ac}.  Note  that  the  relationship  between  d  and  the  set  {a  be}  is  given  by  the  relation  C 
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and  can  not  be  defined  purely  in  terms  of  the  set  U  or  the  element  d.  Not  all  subsets  of  U  need  have 
representations  in  U.  In  the  above  example  there  is  no  representation  for  die  set  {c  f}. 

Definition :  A  subset  C  of  U  is  represented  in  <U  €>  just  in  case  there  is  an  element  z.  of  U 
such  that  x€z  just  in  case  x  is  in  C;  in  this  ease  z  is  called  a  representation  of  C. 

It  is  important  to  note  that  die  notion  of  representadon  expressed  in  die  above  definition  is  not  die 
same  as  die  notion  of  representation  intended  in  die  title  of  this  paper.  A  more  general  notion  of 
representation  will  be  presented  section  four.  However  in  diis  section  the  term  "representation"  will  be  used 
only  in  the  sense  given  in  die  above  definition. 

The  axioms  of  the  dieory  of  symmetric  sets  imply  that  the  universe  U  is  not  empty.  While  a  special 
axiom  to  this  effect  is  not  needed,  the  fact  that  U  is  not  empty  will  be  emphasized  w  ith  an  explicit  axiom. 

Axiom  zero :  U  is  non-empty. 

The  first  axiom  of  symmetric  set  theory  is  that  representations  are  unique. 

Axiom  One.  Fxtensionalily:  Representations  of  non-empty  sets  are  unique,  i.e.  for  any 
non-empty  subset  C  of  U  there  is  at  most  one  element  x  of  U  which  represents  C. 

The  universe  <U  O  should  be  diought  of  as  containing  representations  for  tuples,  functions,  and 
relations  as  well  as  representations  for  subsets  of  U.  For  example  if  x  and  y  arc  elements  of  U  then  z  will  be 
called  a  representation  for  die  pair  <xy>  just  in  ease  z  represents  die  set  {x  {x  y}},  or  more  precisely  z 
represents  a  set  {x  w}  where  w  represents  the  set  {x  y}.  Functions  and  relations  arc  represented  by  sets  of 
tuples  in  die  standard  way  (or  more  precisely  a  function  is  represented  by  a  set  of  elements  of  U  each  of  which 
represents  a  tuple).  Again  the  term  "representation"  is  being  used  here  in  a  different  sense  from  that 
intended  in  the  title  of  the  paper  and  a  more  gcncial  definition  is  given  in  section  four. 

2.2.  The  Comprehension  Axioms 


Given  the  above  axioms  it  is  possible  that  U  contains  only  points,  i.e.  that  no  element  of  U 
represents  a  non-empty  subset  of  U.  Axioms  that  require  that  certain  non-empty  subsets  of  U  be  represented 
in  <U  €>  arc  called  comprehension  axioms.  The  first  comprehension  axiom  makes  use  of  the  following 
definition: 

Definition :  A  subset  C  of  U  will  be  called  small  just  in  case  there  is  some  subset  C’  of  U 
which  is  represented  in  <U  O  such  that  the  cardinality  of  C  is  as  big  as  the  cardinality  of  C. 

A  subset  C  of  U  which  is  not  small  will  be  called  large. 
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By  definition  every  subset  C  of  U  which  is  represented  in  <U  O  is  small.  However  it  has  not  been 
guaranteed  that  every  small  subset  of  U  is  represented  in  <U  €>.  I'his  is  the  first  comprehension  axiom. 

Axiom  Two,  Strong  Comprehension :  Every  small  subset  of  U  is  represented  in  <11  €>. 

The  above  comprehension  axiom  implies  that  die  cardinality  of  a  subset  C  of  U  determines  whether 
or  not  C  is  represented  in  <U  €>.  If  C  is  small  then  it  is  represented  in  <U  €>,  if  it  is  large  Uicn  it  is  not  I'his 
leads  to  the  following  lemma: 

Lemma  2.1 :  If  two  element  sets  arc  small  then  for  any  small  subsets  C  and  C  of  U  all 
functions  from  C  to  C  arc  represented  in  <U  £>. 

Proof.  An  ordered  pair<x  y>  of  elements  of  U  is  taken  to  be  the  set  {x  {x  y}}.  Thus  if  two  element 
sets  arc  small  any  ordered  pair  of  elements  of  U  is  represented  in  <U  €>.  Any  function  from  C  to 
C’  is  a  set  of  such  pairs  with  the  same  cardinality  as  C  and  is  therefore  a  small  subset  of  U. 

One  model  of  the  above  comprehension  axiom  is  a  universe  <U  €>  where  U  is  infinite  and  a 
non-empty  subset  C  of  U  is  small  just  in  case  it  has  less  than  seven  members.  To  rule  out  such  a  universe 
some  further  axioms  are  needed. 

Axiom  Three.  Infinity:  There  exists  an  element  of  U  which  represents  a  countably  infinite 
subset  ofU. 

Axiom  Four.  Power  Set:  If  a  subset  C  of  U  is  small  then  any  subset  of  U  with  the  cardinality 
of  the  power  set  of  C  is  also  small. 

Axiom  Five.  Union:  A  small  union  of  small  sets  is  small,  i.c.  for  any  family  F  of  subsets  of  U 
if  F  is  small  (has  cardinality  less  dian  or  equal  to  some  small  subset  of  U)  and  if  each  set  in  F 
is  small  then  the  union  of  all  sets  in  F  is  also  small. 

There  is  one  final  comprehension  axiom  which  docs  not  correspond  to  any  axiom  of  ZF  set  theory. 
This  final  axiom  will  turn  out  to  be  important  in  later  sections. 

Definition:  P(U)  is  the  set  of  all  points  in  <U  €>. 

Axiom  Six,  Point  Comprehension:  P(U)  is  large. 

Ihis  axiom  implies  that  for  any  small  subset  C  of  U  there  is  a  set  of  jwints  C'  which  is  the  same  size 
as  C.  Since  both  C  and  C  arc  small  lemma  2.1  implies  dial  the  bijcctions  (one  to  one  onto  functions)  from  C 
to  C  are  represented  in  <U  €>.  Thus  any  small  set  can  be  "identified"  with  a  set  of  points. 
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2.3.  The  Foundation  Axiom 

The  Foundation  axiom  is  the  final  axiom  of  the  theory  of  symmetric  sets.  It  states  that  there  are  no 
infinitely  decreasing  membership  chains. 

Axiom  Seven,  Foundation'.  There  is  no  infinitely  decreasing  sequence  of  elements  of  U,  i.e. 

there  is  no  infinite  sequence  x^3  X29xj3  ... 

The  foundation  axiom  has  important  implications.  It  implies  that  there  is  no  element  x  of  U  such 
that  x£x  (otherwise  the  infinite  sequence  all  of  w'hosc  elements  were  x  would  be  an  infinite  decreasing 
sequence).  In  fact  there  can  be  no  containment  loops,  i.e.  no  sequence  x^€x2£  ...  xn  such  that  xn£xj.  ITe 
foundation  axiom  is  equivalent  to  the  statement  that  every  subset  C  of  U  contains  a  lower  bound  under  £,  i.e. 
any  subset  C  of  U  contains  a  lower  bound  x  such  that  there  is  no  y  in  C  such  that  y€x.  The  foundation  axiom 
can  also  be  characterized  in  terms  of  the  transitive  closure  of  €. 

Definition :  The  binary  relation  £  +  is  defined  to  be  the  transitive  closure  of  £,  i.e.  for  any 

two  elements  x  and  y  of  U,  y£  +  x  just  in  case  y€x  or  there  is  some  finite  sequence  z.j,  Z2>  ... 

zn  such  that  yCzj€z2€  ...  znCx. 

The  foundation  axiom  ensures  that  the  relation  £  +  is  a  partial  order  on  U  and  dial  £  +  is  well 
founded.  The  fact  that  £  +  is  a  well  founded  partial  order  on  U  allows  one  to  define  functions  on  U  by 
recursion  on  £  +  .  For  example  it  is  possible  to  define  a  function  P  which  maps  every  element  of  U  to  its 
underlying  set  of  points.  This  function  is  defined  by  recursion  on  £  +  as  follows: 

Definition  of  the  Junction  P: 

P(q)  =  {q}  for  any  point  q 
P(x)  =  Uy€xP(y)  for  any  non-point  x 

Thus  for  any  element  x  of  U  we  can  talk  about  the  points  P(x)  of  the  element  x.  Note  that  P(x)  is 
always  a  subset  of  U  rather  than  being  any  particular  element  of  U  (in  fact  P(x)  is  always  a  subset  of  P(U),  the 
set  of  all  points).  It  is  not  immediately  obvious  that  for  any  element  x  of  U  the  set  P(x)  is  small  and  therefore 
represented  in  <U  £>.  However  this  does  follow  from  the  axioms  presented  so  far. 

Theorem  2.2:  For  my  element  x  of  U,  the  set  P(x)  is  small. 

Proof.  The  proof  is  by  induction  on  £  +  .  For  a  point  p  the  set  l’(p)  is  just  {p}  which  is  clearly 
small.  Now  considci  any  element  x  of  U  which  is  not  a  point  and  such  diat  for  every  y  such  that 
y£  +x.  P(y)  is  small.  I’(x)  equals  die  union  over  y€x  ol'P(y)  and  therefore  P(x)  is  a  small  union  of 
small  sets  and  must  be  small. 
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3.  ISOMORPHISMS 

A  point  is  that  which  has  no  part. 

-  Euclid 


It  seems  that  every  precisely  defined  object  has  a  natural  notion  of  isomorphism  associated  with  it. 
For  example  graphs,  context  free  grammars,  Turing  machines,  and  topological  spaces  all  have  a  natural 
associated  notion  of  isomorphism.  All  of  these  notions  of  isomorphism  arc  based  on  identifications  between 
die  points  of  one  object  and  the  points  of  another.  This  observation  motivates  a  notion  of  structural 
isomorphism  defined  for  arbitrary  elements  of  die  universe  <U  £>  of  mathematical  objects. 

It  is  intuitively  clear  that  isomorphic  madicmatical  objects  are  in  some  sense  identical.  The  strongest 
sense  in  which  two  elements  x  and  y  of  U  can  be  identical  is  if  there  is  some  symmetry  (automorphism)  of 
<U  £>  which  maps  x  to  y.  It  turns  out  that  under  the  aforementioned  notion  of  structural  isomorphism  two 
objects  arc  structurally  isomorphic  just  in  case  there  is  a  symmetry  of  <L  £>  which  maps  one  to  die  other. 

The  notion  of  isomorphism  can  be  generalized  to  take  into  account  an  arbitrary  but  fixed  context. 
At  one  level  any  two  points  p  and  q  are  isomorphic.  However  if  p  and  q  appear  in  some  fixed  context  then  p 
and  q  need  not  he  considered  isomorphic.  For  example  p  and  q  arc  isomorphic  in  die  context  of  the  set 
{p  q  {rs}}  but  they  arc  not  isomorphic  in  the  context  of  die  set  {p  {q  r}}.  It  turns  out  diat  for  any  element  z 
of  U  which  is  taken  as  a  fixed  context  there  is  a  natural  and  general  definition  for  when  tw'o  elements  of  U  are 
isomorphic  in  the  context  of  z. 

The  universe  <U  O  is  intended  to  be  a  model  of  the  universe  of  all  mathematical  objects.  For  this 
reason  die  term  "object"  will  be  used  as  a  synonym  for  the  phrase  "an  clement  of  U". 

3.1.  The  Symmetries  of  <U  O 


An  automorphism  or  symmetry  of  <U  £>  is  a  one  to  one  onto  map  p  from  U  to  U  (a  permutation  of 
U)  swell  diat  for  any  x  and  y  in  U.  p(x)€p(y)  just  in  case  x€y.  It  can  be  shown  that  for  any  symmetry  p  of 
<U  O  and  any  object  x,  p(x)  is  a  point  just  in  case  x  is  a  point.  Thus  for  any  symmetry  p  of  <U  O  the 
restriction  p|l*(U)  of  p  to  the  points  IXU)  is  a  one  to  one  onto  map  from  P(U)  to  P(U),  i.c.  p|P(U)  is  a 
permutation  of  P(U).  Hie  first  important  theorem  concerning  the  symmetries  of  <U  C>  is  diat  each  symmetry 
is  determined  by  its  corresponding  permutation  of  P(U). 

Theorem  3.1 :  If  p  and  p  are  two  symmetries  of  <U  €>  such  that  pJI’(U)  equals  p'|P(U)  dicn 
p  equals  p'. 

Proof.  The  proof  is  h>  induction  on  £  *  .  By  assumption  p  and  p'  are  the  same  function  on  points. 

Consider  any  x  in  l '  such  that  p  and  p'  have  die  same  value  on  all  elements  y  oi  l  i  such  that  yC  +  x. 

Since  p  preserves  the  membership  relation  the  set  represented  b>  p(\)  equals  ip(v):  ytx}. 

Similarly  the  set  represented  by  p'(\)  equals  {p  (>):  yCx}.  Hut  since  p  and  p'  aie  the  same  function 
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on  all  y€  +  x,  p(x)  and  p’(x)  must  represent  the  same  set.  Thus  by  extcnsionality  p(x)  must  equal 
p'(x). 

Any  symmetry  of  <U  €>  determines  a  permutation  of  P(U)  and  theorem  3.1  shows  that  the  induced 
permutation  of  P(U)  uniquely  determines  the  symmetry.  It  can  also  be  shown  that  every  permutation  of  P(U) 
corresponds  to  a  symmetry  of  <U  €>. 

Theorem  3.2 :  Any  permutation  p  of  P(U)  can  be  extended  to  a  symmetry  of  <U  €>. 

Proof.  I.et  p  be  any  permutation  of  P(U).  The  extension  of  p  to  all  of  U  is  defined  by  induction  on 
€+  via  the  following  relation: 

p(x)  =  the  representation  of  (p(y):  y€x} 

The  set  (p(y):  y£x}  is  guaranteed  to  be  represented  in  <U  £>  because  it  can  be  no  larger  than  the 
set  represented  by  x.  It  follows  from  die  above  equation  that  if  y€x  then  p(y)£p(x)  and  further  if 
p(y)€p(x)  then  y€x.  It  remains  only  to  show  that  the  extension  of  p  to  all  of  U  is  one  to  one  and 

onto.  Consider  the  inverse  permutation  p'^  of  P(U)  and  die  extension  of  this  inverse  to  all  of  U.  It 
can  be  shown  by  a  standard  induction  on  €+  that  p"^(p(x))  equals  x  for  all  x  in  U  and  thus  the 

extension  of  p  is  one  to  one.  Similarly  it  can  be  shown  that  p(p’*(x;)  must  equal  x  and  thus  the 
extension  of  p  is  onto. 

Theorems  3.1  and  3.2  imply  that  there  is  a  natural  one  to  one  relationship  between  die  permutations 
of  the  points  P(U)  and  die  symmetries  (automorphisms)  of  <U  €>.  In  classical  set  theory  there  is  only  one 
point  and  there  is  only  one  symmetry  of  <U  €>,  namely  the  identity  function. 

If  there  is  a  symmetry  of  <U  €>  which  maps  x  to  y  then  x  and  y  arc  truly  indistinguishable.  More 
concretely  let  <h  be  any  first  order  formula  of  one  free  variable  whose  only  non-logical  symbol  is  £.  If  there  is 
a  symmetry  of  <U  €>  which  maps  x  to  y  then  <h(x)  holds  in  <U  €>  just  in  ease  «l>(y)  holds  in  <U  €>. 

Theorem  2.2  says  that  for  any  object  x  the  set  P(x)  is  small  and  thus  P  can  be  thought  of  as  a 
mapping  from  U  to  U.  A  simple  induction  on  £  +  can  be  used  to  show  that  the  mapping  P  commutes  with 
symmetries  of  <U  O,  i.e.  that  for  any  symmetry  p  of  <U  €>  and  any  object  x,  P(p(x))  equals  p(P(x)). 

Lemma  3.3:  For  any  symmetry  p  of  <U  €>  and  any  object  x,  P(p(x))  equals  p(P(x». 

3.2.  Structural  Isomorphisms 


There  is  a  natural  definition  for  what  it  means  for  two  graphs,  or  languages,  or  lists  of  points  to  be 
isomorphic.  All  of  these  objects  can  be  represented  by  elements  of  U  and  it  would  be  nice  if  the  notion  of 
isomorphism  which  is  defined  for  elements  of  U  corresponded  to  the  natural  notion  of  isomorphism  for  such 
objects.  This  observation  leads  to  the  definition  of  structural  isomorphism  presented  below. 

Any  two  points  have  the  same  structure  simply  because  neither  has  any  structure.  I  arger  objects  are 
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structurally  isomorphic  just  in  case  there  is  an  identification  between  the  points' of  the  objects  which  preserves 
Lite  structure  of  those  objects,  For  example  let  p,  q.  r.  and  s  be  any  four  distinct  points.  The  set  { p  q }  is 
isomorphic  to  the  set  {rs},  but  not  the  set  {rsp}.  In  fact  any  two  sets  of  points  .ire  isomorphic  just  in  ease 
they  have  the  same  number  of  elements.  Ihc  triple  <p  p  q>  is  isomorphic  to  die  triple  <r  r  s>  but  not  to  the 
triple  <r  s  s>. 

To  define  die  notion  of  structural  isomorphism  precisely  it  is  necessary  to  build  up  some 
terminology.  Let  C  be  any  set  of  points.  U(C)  is  defined  to  be  the  set  of  all  elements  x  of  U  such  diat  P(x)  is  a 
subset  of  C.  Thus  U(C)  is  the  set  of  objects  w  hich  arc  built  up  out  of  die  points  in  C.  Let  p  be  any  function 
mapping  C  to  arbitrary  points.  Any  such  function  p  can  be  extended  to  a  function  p’  defined  on  all  of  U(C) 
via  the  following  inductive  definition: 


p’( p)  =  p(p)  for  points  p 

p'(z)  =  Flic  representation  of  {p'(y):  y€z}  for  any  non-point  /.  in  U(C). 

For  example  if  p(p)  is  r  and  p(q)  is  s  then  p'(< p  q>)  is  <r  s>.  Thus  die  function  p’  "replaces"  the 
points  of  an  object  by  their  image  under  p.  '■  the  inductive  definition  of  p’(z)  the  set  {p’(y):  y€z}  is 

guaranteed  to  be  represented  in  <U  O  because  it  can  be  no  larger  dian  the  set  represented  by  z.  In  die 

following  discussion  any  function  p  defined  on  the  points  C  will  be  assumed  to  be  defined  in  the  above  way 
on  all  ofU(C). 

Definition:  A  structural  isomorphism  between  two  elements  x  and  y  of  U  is  a  bijcction  p 
from  P(x)  to  P(y)  such  that  p(x)  equals  y  (any  function  defined  on  P(x)  will  be  assumed  to  be 

defined  on  x  via  the  above  relation).  Flic  elements  x  and  y  arc  said  to  be  structurally 

isomorphic  just  in  case  dicrc  exists  a  structural  isomorphism  betw  een  them. 

As  an  example  let  a  group  <G  °>  be  a  pair  of  a  set  of  points  G  and  a  function  °  from  GXG  to  G 
satisfying  the  standard  axioms  for  a  group.  Notice  that  P(<G  °>)  equals  G.  Now  consider  two  group*'  <G  °> 
and  <G'  °  >  and  let  p  be  any  bijcction  from  G  to  G'.  Clearly  p(G)  equals  G’  so  p(<G  °>)  equals  <G'  p(°)>. 
Thus  the  bijcction  p  is  a  structural  isomorphism  between  <G  °>  and  <G’  °’>  just  in  case  p(°)  equals  °\  Since 
functions  arc  represented  by  sets  of  tuples  an  element  of  °  is  a  triple  of  points  <p  q  r>  where  r  is  the  value  of 
p°q.  Thus  p(°)  is  a  set  of  triples  of  the  form  <p(p)  p(q)  p(r)>.  Flic  set  of  triples  p(°)  will  equal  the  set  of 
triples  °'  just  in  case  for  any  triple  <p  q  r>  in  °  the  triple  <p(p)  p(q)  p(r)>  is  in  *>',  i.e.  just  in  ease  p(r)  equals 
p(p)°'p(q).  This  statement  is  equivalent  to  the  condition  that  p(p)°'p(q)  equals  p(p°q)  which  corresponds  to 
die  standard  notion  of  isomorphism  between  groups. 

Flic  notion  of  a  structural  isomorphism  can  be  related  to  the  symmetries  of  <U  C>.  It  has  already 
been  shown  that  there  is  a  natural  one  to  one  correspondence  between  the  automorphisms  of  <U  O  and  the 
permutations  of  P(U).  File  following  theorem  relates  bijections  between  sets  of  points  and  permutations  of 
l’(U). 
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Theorem  14:  Any  Injection  between  two  small  sets  of  points  can  be  extended  to  a 
permutation  of  all  of  P(U). 

Proof  Sketch:  l.ct  C  and  C'  be  any  two  small  sets  of  points  and  let  p  be  any  Injection  from  C  to  C'. 
I.et  p  be  any  member  of  either  C  or  C\  Since  C  and  C  need  not  be  disjoint  p  may  be  in  both  C  and 

C'  and  thus  both  p(p)  and  p’Hp)  may  he  defined.  In  general  any  point  p  in  either  C  or  C'  is 

contained  in  some  minimal  chain  of  the  form  ...  p"'(p"'(p)). p  '(p).  p,  p(p).  p(p(p))...  (more 
precisely  the  minimal  chain  containing  p  is  the  least  subset  of  CUC"  which  contains  p  and  is  closed 

under  p  and  p"').  The  minimal  chain  containing  p  can  be  one  of  four  types.  First  it  might  be  a 
loop,  in  which  case  p  is  already  a  permutation  of  the  minimal  chain  containing  p.  Second  it  might 
be  infinite  in  both  direction  in  which  ease  p  is  also  already  a  permutation  of  the  chain.  Iltird  the 
chain  may  have  a  "first"  member  which  is  in  C  but  not  in  C'  and  a  last  member  which  is  in  C’  but 
not  in  C.  Finally  die  minimal  chain  containing  p  may  have  only  one  endpoint,  either  a  starting 
point  or  an  ending  point.  In  these  eases  p  is  not  a  permutation  of  the  minimal  chain  containing  p. 
To  remedy  this  situation  one  can  extend  the  function  p  to  more  points  and  convert  any  chain  of  the 
these  last  types  into  either  a  cycle  or  a  chain  which  is  infinite  in  both  directions.  To  make  such  an 
extension  there  must  be  enough  points  in  P(U)  which  arc  not  in  C  or  C'.  Hut  since  both  C  and  C' 
arc  assumed  to  be  small  this  last  condition  can  be  readily  shown. 

The  main  result  of  this  section  can  now  be  proven. 


Theorem  3.5  Two  elements  x  and  y  of  U  are  structurally  isomorphic  just  in  case  there  is  a 
symmetry  of  <U  €>  which  maps  x  to  y. 

Proof.  If  x  and  y  arc  structurally  isomorphic  then  there  is  a  bijection  p  from  P(x)  to  P(y)  which 
maps  x  to  y.  Any  extension  of  p  to  more  points  w  ill  still  map  x  to  y.  Thus  theorem  3.4  implies  that 
p  can  be  extended  to  a  permutation  of  P(U)  which  maps  x  to  v.  On  the  other  hand  if  there  is  a 
permutation  p  of  P(U)  which  maps  x  to  y  lemma  3.3  implies  that  P(p(x))  equals  p(P(x))  so  p( P(x» 
equals  P(y)  and  thus  p  maps  P(x)  onto  P(y).  Thus  the  restriction  of  p  to  P(x)  is  a  bijection  from  P(x) 
to  P(y)  which  maps  x  to  y. 

Theorem  3.5  demonstrates  that  the  two  natural  notions  of  isomorphism  between  symmetric  sets 
coincide.  Thus  there  is  never  any  ambiguity  in  what  is  meant  by  two  elements  of  U  being  isomorphic. 


.3.3.  Symmetry  and  Contextual  Isomorphisms 


Since  there  is  a  natural  notion  of  isomorphism  for  elements  of  U  there  is  also  a  natural  notion  of 
automorphism  or  symmetry.  For  example  there  arc  two  structural  symmetries  of  {p  q}.  the  identity  map  on 
the  points  p  and  q  and  the  function  which  exchanges  p  and  q.  Since  any  structural  isomorphism  can  be 
extended  to  a  symmetry  of  <U  €>  there  is  no  need  to  distinguish  between  structural  isomorphisms  and 
permutations  ofl’(lJ). 

Definition:  I  he  symmetry  group  of  an  object  x,  denoted  A(x),  is  the  set  of  all  permutations 
p  olT(U)  such  that  p(x)  equals  x. 

I  here  is  a  contextual  notion  of  isomorphism  where  two  objects  x  and  y  are  isomorphic  in  a  context  / 
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just  in  case  x  and  y  bear  exactly  the  same  relationship  to  /.  For  example  let  /  be  the  set  {p  q  {r  s}}  where  p,  q, 
r,  and  s  arc  points.  Clearly  the  point  p  is  just  like  the  point  q  with  respect  to  z  (both  p  and  q  arc  members  of  i 
and  neither  is  a  member  of  a  member  of  /.).  Similarly  die  point  r  is  just  like  the  point  s  with  respect  to  /.  On 
the  other  hand  p  is  not  like  the  point  r  since  p  is  a  member  of  z  while  r  is  not. 

As  another  example  let  y  be  the  set  (<p  q>  <q  r>  <r  p>}.  This  represents  a  directed  graph  with  nodes 
p,  q.  and  r  and  edges  from  p  to  q.  q  to  r,  and  r  to  p.  In  other  words  y  represents  a  cyclic  directed  graph  of 
direc  nodes.  Notice  that  every  node  of  y  looks  like  every  other  node.  More  precisely  y  has  three  structural 
symmetries  corresponding  to  three  rotations  of  die  graph.  For  any  two  nodes  there  is  a  rotation  which  maps 
one  to  the  other. 

Definition:  Two  objects  x  and  y  will  be  called  isomorphic  in  the  context  of  an  object  i  just  in 
case  there  is  a  symmetry  p  of  z  such  that  p(x)  equals  y.  The  set  of  things  isomorphic  to  y  in 
die  context  of  z  will  be  denoted  |y|z. 


A  > 
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4.  ABSTRACTION,  RKPRFSENTA' TION,  AND  OTHER  APPLICATIONS 


There  is  an  informal  distinction  in  mathematics  between  essential  and  contextual  properties.  For 
example  consider  an  open  set  of  some  topological  space.  The  fact  that  the  set  is  open  is  a  contextual  property 
of  that  set  while  the  cardinality  of  the  set  is  an  essential  property.  In  the  framework  of  symmetric  sets  this 
distinction  is  easily  made  precise.  Any  two  isomorphic  objects  have  the  same  essential  properties.  An 
essential  predicate  $  is  any  predicate  on  U  such  that  <l>(x)  is  equivalent  to  0(p(x))  for  any  object  x  and  point 
permutation  p.  An  essential  function  is  one  that  commutes  with  automorphisms  of  <U  O,  i.e.  F'(p(v))  always 
equals  p(F(x)). 

In  mathematics  one  often  encounters  a  notion  of  a  "canonical"  or  "natural"  transformation  or 
relationship.  For  example  a  set  of  points  has  no  natural  or  canonical  element,  there  is  no  natural  or  canonical 
point  on  a  circle,  and  a  square  has  no  canonical  comer.  On  the  other  hand  one  can  choose  a  canonical 
element  of  an  ordered  pair.  One  particularly  well  known  example  is  the  dual  space  of  a  linear  vector  space. 
The  dual  space  D(X)  of  a  linear  vector  space  X  is  die  set  of  linear  functions  from  X  to  scalars.  If  there  is  a  dot 
product  operation  *  defined  on  X  then  there  is  a  natural  isomorphism  between  X  and  D(X)  where  the  linear 
function  associated  with  a  vector  x  is  Xy.x'y.  However  if  no  dot  product  operation  is  specified  for  X  dicn 
while  X  and  IXX)  arc  still  isomorphic  there  is  no  natural  or  canonical  isomorphism.  On  the  other  hand  dierc 
is  always  a  canonical  or  natural  isomorphism  between  X  and  DOXX)).  A  simple  and  natural  definition  for 
diis  notion  of  canonical  is  given  by  saying  diat  y  is  canonical  in  the  context  of  x  just  in  case  die  isomorphism 
class  of  y  in  the  context  of  x,  |y|x,  contains  only  one  object. 

Hie  theory  of  symmetric  sets  provides  a  simple  general  measure  of  the  abstractness  of  objects.  An 
object  y  can  be  said  to  be  an  abstraction  of  an  object  x  just  in  ease  any  one  of  the  following  three  conditions 
hold:  A(y)  contains  A(x).  |y|x  is  a  singleton  set,  or  y  equals  F'(x)  for  some  essential  function  F.  It  turns  out  that 
these  direc  criterion  arc  equivalent  and  there  is  no  ambiguity  in  what  is  meant  by  y  being  an  abstraction  of  x. 

There  arc  many  representation  theorems  in  mathematics.  For  example  every  Boolean  algebra  can 
be  represented  by  an  algebra  of  sets.  Of  course  the  notion  of  a  representation  is  also  heavily  used  in  computer 
science  where  alphabets  are  represented  as  binary  codes  and  sets  are  represented  as  lists.  This  raises  die 
natural  question  of  what  is  meant  in  general  by  a  representation.  The  dicory  of  symmetric  sets  provides  a 
natural  framework  in  which  to  develop  a  general  theory  of  representation. 

Mathematicians  often  talk  about  fixed  but  arbitrary  structures  which  form  n  context  in  which  to 
investigate  other  structures.  For  example  the  natural  numbers  arc  assumed  to  be  a  fixed  set  even  (hough  dieir 
"true  identity"  can  never  be  specified.  The  same  holds  for  die  real  and  complex  numbers.  Another  example 
of  contextual  objects  arc  the  fixed  constants  "true"  and  "false"  which  are  used  in  discussions  of  logic.  Still 
another  example  from  logic  is  die  fixed  but  arbitrary  alphabet  from  which  the  sentences  of  logic  arc 
constructed,  liven  the  "empty  set”  can  be  viewed  as  an  object  which  is  taken  to  be  fixed  but  whose  identity  is 
never  specified.  The  general  notion  of  a  context  can  be  handled  in  a  natural  way  in  the  theory  of  symmetric 
sets.  A  context  is  an  object  /.  (which  may  have  lots  of  internal  structure)  which  is  "taken  to  be  fixed".  This 
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means  that  only  symmetries  of  z.  arc  considered  when  talking  about  isomorphisms,  symmetry  groups,  essential 
properties,  and  canonical  objects. 

4.1.  Essential  Functions,  Canonicality,  and  Abstraction 


The  set  {p  q}  can  be  thought  of  as  essential  property  of  the  pair  <p  q>.  That  is  to  say  that  given  a 
pair  of  points  one  can  derive  in  a  natural  way  a  set  of  two  points.  The  reverse  does  not  seem  to  hold,  given  a 
set  of  two  points  there  is  no  natural  or  canonical  way  to  derive  a  pair  of  two  points.  Similarly  given  a  point  p 
there  is  no  natural  or  canonical  set  of  two  points  which  contains  p.  Conversely  given  a  set  of  two  points  there 
is  no  natural  or  canonical  element  of  that  set.  Recall  that  for  any  objects  y  and  z,  |y|z  is  die  set  of  all  things 
isomorphic  to  y  in  the  context  of  z,  i.e.  |y|z  is  the  set  of  things  which  can  be  written  as  p(y)  for  some  symmetry 
p  of  z. 

Definition :  An  object  y  will  be  called  canonical  in  the  context  of  an  object  z  just  in  ease  |y|z 
is  a  singleton  set 

Iherc  arc  many  intuitive  examples  of  objects  which  are  not  natural  or  canonical.  There  is  no  natural 
or  canonical  point  on  the  perimeter  of  a  circle.  There  is  no  canonical  corner  on  a  square.  There  is  no  natural 
or  canonical  coordinate  system  for  three  dimensional  space.  On  the  other  hand  consider  an  oblique  triangle 
(where  no  two  sides  have  the  same  length).  One  can  choose  a  canonical  vertex  for  such  a  triangle  by  choosing 
the  vertex  connecting  the  two  shorter  sides. 

There  is  another  intuitively  satisfying  notion  of  what  a  canonical  object  is.  Intuitively  y  is  canonical 
in  the  context  of  z  if  one  can  define  a  function  which  takes  z  and  unambiguously  returns  y.  This  notion  of  a 
canonical  object  is  problematic  because  for  any  two  objects  z  and  y  there  is  a  function  which  maps  z  to  y. 
However  one  wants  a  definable  function,  Remember  that  the  universe  <U  O  has  many  non-trivial 
automorphisms  and  every  function  from  LI  to  U  which  is  defined  in  terms  of  the  structure  of  the  universe 
<U  O  must  respect  those  automorphisms.  In  particular  any  function  F  defined  in  terms  of  the  structure  of 
<U  €>  must  be  essential  in  the  following  sense: 

Definition'.  A  function  F  from  U  to  U  will  be  called  essential  just  in  case  for  every  symmetry 
p  of  <U  O  and  every  element  x  of  U,  l  (p(x»  equals  p(l-'(x)). 

Intuitively  die  image  of  an  object  x  under  an  essential  function  F  can  contain  no  more  information 
than  the  object  x  itself.  Thus  it  might  be  said  diat  F(x)  is  an  abstraction  of  x. 

Definition'.  An  object  y  will  be  called  an  abstraction  of  an  object  /.just  in  ease  y  equals  F(z) 
for  some  essential  function  F. 


The  main  icsult  of  this  section  relates  the  (he  notion  of  canonicality,  the  notion  of  abstraction,  and  die 
symmetry  groups  of  objects. 
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Theorem  4.1,  The  First  Abstraction  Theorem :  For  any  objects  y  and  z  the  following  arc 
equivalent: 

1)  A(y)  contains  A(z). 

2)  |y|z  is  a  singleton  set. 

3)  y  equals  F(z)  for  some  essential  function  F. 

Proof. 

1<  =  >2:  If  A(y)  contains  A(z)  then  for  any  symmetry  p  of  z,  p(y)  equals  y  so  |y|z  contains  only 

y.  On  the  other  hand  if  |y|z  is  a  singleton  set  then  p(y)  equals  y  for  any  symmetry  p  of  /.  so  A(z)  is  a 
subset  of  A(y). 

1  =  >3:  Given  that  A(y)  contains  A(z)  an  essential  function  F  from  U  to  U  can  be  defined  as 
follows:  Of  course  F(z)  is  defined  to  be  y.  For  any  w  which  is  isomorphic  to  /.  let  p  be  some  point 
pennutation  such  that  p(z)  equals  w  and  define  F(w)  to  be  p(y)  (this  is  equivalent  to  defining 
F(p(z))  to  be  p( !-(/.))  for  any  point  permutation  p).  It  must  be  shown  that  this  definition  of  F(w)  is 
independent  of  the  choice  of  p.  In  particular  if  p  and  p  are  two  point  permutations  such  that  p(z) 
equals  p'(z)  equals  w  then  it  must  be  shown  that  p(y)  equals  p’(y).  If  p( z)  equals  p'(z)  then 

p'kpV))  equals  z  so  p‘*°p’  is  a  symmetry  of  z.  But  since  A(z)  is  a  subset  of  A(y),  p"*°p’  is  a 

symmetry  of  y  and  thus  p'  Vp’(y))  equals  y  which  implies  that  p’(y)  equals  p(y). 

It  will  now  be  shown  that  for  any  w  which  is  isomorphic  to  z  and  any  point  permutation  p, 
F(p(w))  equals  p(F(w)).  Since  w  is  isomorphic  to  z  there  is  some  point  pennutation  p’  such  that  w 
equals  p'(z).  Now  F(p(w))  equals  F(p°p‘(z))  which  equals  p°p'(F(z))  which  equals  p(p'(F'(z)) 
which  equals  p(F(p’(z))  which  equals  p(F(w)).  Thus  for  any  w  isomorphic  to  z,  F  satisfies  the 
condition  for  being  an  essential  function.  To  complete  the  definition  of  F  let  F(w)  be  w  for  any  w 
not  isomorphic  to  z. 

3  =  >1 :  Suppose  F  is  an  essential  function  such  that  F(z)  equals  y  and  let  p  be  any  symmetry  of 

z.  Since  F  is  an  essential  function  p(F(z))  equals  F(p(z))  which  equals  F(z).  Thus  p  is  a  symmetry 
of  F(z),  i.e.  p  is  a  symmetry  of  y.  Thus  A(y)  contains  A(z). 


4.2.  Abstraction  and  Points 


The  notion  of  abstraction  has  some  important  relationships  to  points.  The  first  lemma  about  points 
concerns  the  isomorphism  classes  of  points  in  the  context  of  an  object  x. 

I.emma  4.2\  For  any  object  x  and  point  p,  |p|x  is  either  a  subset  of  P(x)  or  is  all  of  P(U)-P(x). 

Proof.  For  any  symmetry  p  of  x.  p(P(x))  equals  P(p(x))  equals  P(x).  Thus  any  symmetry  p  of  x 
induces  a  pennutation  of  P(x).  Ihus  for  any  point  p  in  P(x).  |p|  is  a  subset  of  P(x).  On  the  other 

hand  for  any  two  points  r  and  s  which  arc  not  in  P(x)  there  is  a  symmetry  of  x  which  exchanges  r 
and  s.  Thus  if  r  is  not  in  l\x)  then  |r|x  contains  all  of  P(U)-P(x).  Furthermore  for  r  not  in  P(x),  |r|x 

can  not  intersect  P(x)  since  otherwise  there  would  be  some  point  p  in  P(x)  such  that  |p|x  was  not  a 
subset  of  P(x).  I  hus  cither  p  is  in  P(x)  and  |p|x  is  a  subset  of  l*(x)  or  p  is  not  in  P(x)  and  |p|  equals 
P(U)-P(x). 


Corollar y  4.3 :  For  any  object  x  and  point  p,  p  is  in  P(x)  just  in  case  |p|x  is  small. 
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Corollary  4.3  immediately  implies  that  for  any  object  x  the  set  P(x)  i<  determined  by  the  symmetry 
group  of  x.  In  fact  Corollary  4.3  leads  directly  to  the  following  theorem: 

Theorem  4.4  For  any  objects  y  and  x,  if  y  is  an  abstraction  of  x  then  P(y)  is  a  subset  of  P(x). 

Proof,  l  et  p  be  any  point  in  P(y).  Since  A(y)  contains  A(x),  |p|y  contains  |p|x.  But  by  Corollary  4.3 
|p|y  is  small  so  |p|x  must  also  be  small  and  therefore  p  must  be  in  P(x). 

Theorem  4.4  immediately  implies  that  for  any  for  any  essential  function  F  and  any  object  x.  P(F(x)) 
is  a  subset  of  P(x).  It  is  important  to  realize  that  the  converse  of  theorem  4.4  docs  not  hold,  i.e.  if  P(y)  is  a 
subset  of  P(x)  then  y  need  not  be  an  abstraction  of  x.  For  example  {p  q}  is  not  an  abstraction  of  {p  q  r}. 

4.3.  Representation  and  Transformational  Isomorphisms 


Any  finite  set  of  points  can  be  represented  by  a  finite  list  of  points  though  there  is  no  canonical 
representation  for  a  set  of  points  as  a  list  of  points.  More  precisely  there  is  an  essential  function  F  which  maps 
any  finite  list  of  points  to  the  finite  set  of  points  contained  in  that  list  and  any  finite  set  y  can  be  w  ritten  as 
F(x)  for  some  finite  list  x.  Note  that  the  function  F  is  from  the  representations  to  die  represented  objects. 
Also  note  that  die  function  from  representations  to  represented  objects  is  onto,  i.e.  every  object  which  is  to  be 
represented  must  have  a  representation.  These  observations  lead  to  die  following  definition  of  a  uniform 
representation. 

Definition:  Let  C  and  R  be  subsets  of  U,  let  F  be  an  essential  function,  and  let  F(R)  denote 
(F(x)  for  x  in  R}.  F  is  said  to  be  a  uniform  representation  of  elements  of  C  as  elements  of  R 
just  in  ease  F(R)  contains  C. 

Note  diat  lists  of  points  can  not  be  represented  as  sets  of  points  because  dicrc  is  no  essential  function 
which  maps  a  set  of  points  to  a  list  of  diose  points  (dicre  is  no  canonical  representation  of  a  set  of  points  as  a 
list).  A  similar  example  involves  multisets.  A  list  of  points  can  be  used  to  represent  a  multiset  of  points,  but 
multisets  of  points  can  not  be  used  to  represent  lists. 

There  arc  certain  eases  in  mathematics  where  two  different  (non-isomorphic)  things  are  "essentially 
the  same  thing".  For  example  an  equivalence  relation  on  a  set  of  points  C  is  a  relation,  i.e.  a  set  of  pairs, 
which  is  reflexive,  symmetric,  and  transitive.  A  partition  of  C  is  a  family  of  disjoint  subsets  of  C.  Any 
equivalence  relation  on  C  can  be  viewed  as  a  partition  ofC  and  vice  versa.  Another  simple  example  involves 
the  representation  of  tuples.  For  example  a  tuple  of  points  <p  q>  can  be  viewed  as  the  set  {p  {p  q}}  or  as  die 
set  {q  {p  q} }.  The  following  definition  makes  the  notion  of  "essentially  die  same”  more  concrete. 
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Definition:  An  essential  function  F  from  U  to  U  which  is  also  a  permutation  of  l)  will  be 
called  a  transformational  symmetry  of  <U  €>.  Two  objects  x  and  y  will  be  called 
transformationally  isomorphic  just  in  case  there  is  a  transformational  symmetry  of  <U  O 
which  maps  x  to  y. 

If  two  objects  x  and  y  arc  transformationally  isomorphic  then  there  is  a  sense  in  which  they  are 
indistinguishable.  More  precisely  the  following  lemma  holds: 


Lemma  4.5:  For  any  transformational  symmetry  F  there  is  a  translation  operator  T  which 
maps  any  monadic  essential  predicate  to  a  monadic  essential  predicate  T(d>)  such  that  for 
any  object  x.  4>(x)  holds  just  in  case  T(4>XF(x))  holds. 

Proof.  Since  F  is  a  permutation  of  U  it  has  an  inverse  F*^  which  is  easily  shown  to  be  an  essential 

function.  Let  T(<t>)  be  the  predicate  Xw.<J>(F’^(w)).  Clearly  4>(x)  is  equivalent  to  T(4>XF(x)). 

Furthermore  T(<t>)  is  easily  seen  to  be  an  essential  predicate. 

Lemma  4.5  gives  a  precise  relationship  between  objects  which  are  transformationally  isomorphic.  In 
particular  if  x  and  y  arc  transformationally  isomorphic  via  the  transformational  symmetry  F  then  any  essential 
statement  (or  question)  4>(x)  concerning  x  is  equivalent  to  some  essential  statement  T(4>X>)  concerning  y.  It 
turns  out  that  two  objects  x  and  y  are  transformationally  isomorphic  just  in  case  A(x)  equals  A(y).  However 
the  condition  that  A(x)  equals  A(y)  does  not  ensure  that  there  is  a  definable  transformational  symmetry  F 
which  maps  x  to  y. 

Definition:  A  function  F  from  U  to  U  will  be  called  definable  just  in  ease  there  is  a  first  order 
formula  <J>  of  two  free  variables  whose  only  non*logical  symbol  is  6  such  that  for  any  two 
objects  x  and  y,  4>(x  y)  holds  in  <U  O  just  in  ease  y  equals  F(x). 

Consider  the  real  numbers  <R  +  *  <>  where  R  is  a  set  of  points,  +  and  *  are  binary  operations  on 
R,  and  <  is  a  total  order  on  R.  A(<R  +  *  <>)  is  the  group  of  all  symmetries  which  leave  every  point  in  R 
fixed  (any  symmetry  of  <R  +  *  <>  must  leave  one  and  zero  fixed).  Now  consider  a  pair  <R  <’>  where  <’  is 
a  well  ordering  of  the  set  of  points  R.  It  is  easy  to  show  that  A(<R  <’>)  is  also  the  group  of  all  permutations 
which  leave  every  point  in  R  fixed.  Since  A(<R  +  *  <>)  equals  A(<R  <‘>),  <R  +  *  <>  is  transformationally 
isomorphic  to  <R  <’>.  However  there  is  probably  no  definable  transformational  symmetry  of  <U  €>  which 
maps<R  +  *  <>  to<R  <j’>. 

4.4.  Context 


Intuitively  a  context  is  a  collection  of  objects  which  arc  taken  to  be  "fixed".  There  arc  some  objects 
which  arc  taken  to  be  fixed  over  all  of  mathematics.  For  example  mathematicians  often  speak  of  ’The” 
natural  numbers,  even  though  the  identity  of  the  natural  numbers  can  not  be  specified  (though  the  structure 
of  die  numbers  can  be  specified  up  to  isomorphism).  The  same  holds  for  "the”  real  numbers,  or  "the” 
complex  plain.  In  logic  one  often  assumes  that  there  is  a  particular  thing  which  is  die  constant  "true"  and  a 
particular  thing  which  is  the  constant  "false".  A  more  controversial  example  is  the  empty  set.  There  are  other 
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examples  of  "context"  where  the  context  is  not  even  specified  up  to  isomorphism.  The  phrase  "fixed  but 
arbitrary"  is  often  used  in  mathematical  writing  and  serves  to  specify  a  context  for  a  mathematical  discussion. 

As  another  example  of  context  consider  a  linear  vector  space.  A  linear  vector  space  has  an  associated 
field  (usually  "the"  real  or  complex  numbers)  such  that  any  vector  can  be  sealed  by  an  element  of  the  field. 
In  discussions  of  linear  vector  spaces  the  field  is  usually  taken  to  be  fixed.  Thus  in  relating  two  vector  spaces 
one  usually  assumes  they  have  the  same  associated  field  of  scalars. 

As  yet  another  example  consider  a  particular  first  order  language  L.  The  language  I.  is  taken  to  be  a 
sequence  of  typed  symbols  which  determines  a  set  of  well  formed  formulas.  Such  a  language  is  usually  taken 
to  be  arbitrary  but  fixed  in  discussions  of  logic. 

To  generalize  the  results  of  the  previous  section  it  is  useful  to  define  the  notion  of  a  contextual 
symmetry  group. 

Definition:  The  symmetry  group  of  y  in  the  context  of  z,  denoted  Az(y),  is  the  set  of  all 
symmetries  of  z  which  arc  also  symmetries  of  y.  More  simply  A  (y)  equals  A(y)  intersect 
A(z). 

Note  that  any  symmetry  of  the  pair  <y  z>  must  be  a  symmetry  of  y  and  a  symmetry  of  z,  and 

anything  which  is  a  symmetry  of  both  y  and  z  must  be  a  symmetry  of  the  pair  <y  z>.  Thus  A(<y  z>)  is  the 

intersection  of  A(y)  and  A(z)  so  Az(y)  equals  A(<y  z>).  The  notion  of  an  essential  function  can  also  be  made 
contextual: 

Definition:  A  function  F  from  U  to  U  will  be  called  essential  in  the  context  of  z  just  in  case 
for  any  object  x  any  symmetry  p  of  z,  F(p(x))  equals  p( F(x)). 

A  good  example  of  a  contextually  essential  function  is  the  cardinality  function  on  finite  sets.  Let 

<N  <>  be  "die”  natural  numbers  where  N  is  a  set  of  points  and  <  is  a  binary  relation  which  orders  those 

points.  The  function  F  which  maps  any  finite  set  x  to  the  natural  number  representing  the  size  of  x  is  essential 
in  die  context  of  <N  <>.  Note  that  diis  cardinality  function  is  not  essential  outside  of  this  context  because 
F(x)  can  be  a  point  not  found  in  x.  The  following  lemma  provides  an  alternative  characterization  of  functions 
which  are  contextually  essential. 

1. annul  4,7:  A  function  F  from  U  to  U  is  essential  in  the  context  of  z  just  in  case  it  can  be 
written  as  \x.G(<x  />)  for  some  essential  function  G. 

Proof.  If  F  can  be  written  as  Ax.G(<x  z>)  then  it  is  easy  to  show  that  F  is  essential  in  the  context  of 
z.  On  the  other  hand  assume  F  is  essential  in  the  context  of  z.  First  if  w  is  not  a  pair  w  hose  second 
component  is  isomorphic  to  z  dien  G(w)  is  defined  to  be  w.  If  w  is  a  pair  whose  second  component 
is  isomorphic  to  z.  then  w  can  be  written  as  p(<x  z>)  for  some  object  x  and  point  permutation  p.  In 
this  ease  G(w).  which  can  be  written  as  G(p(<x  ■/>)),  is  defined  to  be  p(F(x)).  It  must  first  be  shown 
that  G  is  well  defined,  i.c.  diat  if  p(<x  z>)  equals  p’(< y  z»  then  p(l-(x))  equals  p'(F(y)).  First  note 

dial  p  *(p'(<y  z»)  equals  <x  z>  so  p'(p‘(y))  equals  x  and  p’*(p'(z))  equals  z.  T  hus  p  *°p'  is  a 

symmetry  of  z.  Now  since  F  is  essential  in  the  context  of  z.  p*  ^  °p'(F(> ))  equals  l  (p  *  °p'(y))  which 

equals  F(x).  Hut  if  F(x)  equals  p'  *°p'(F(y))  then  p(F(x))  must  equal  p-(F(y)). 
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It  follows  directly  from  the  definition  of  G  dial  G(<x  />)  equals  l  (x)  and  thus  F  can  he  written 
as  \x.G(<x  />).  To  show  that  G  commutes  with  arbitrary  point  permutations  let  w  be  any  object 
and  p  be  any  point  permutation.  If  w  can  not  be  written  as  p'(<x  z>)  for  some  x  and  p"  then  G(w) 
equals  w  and  G(p(w))  equals  p(w)  so  the  result  is  trivial.  On  the  other  hand  suppose  w  can  be 
written  as  p’(<x  z».  In  this  ease  G(p(w))  equals  G(p°p'(<x  /.>))  which  equals  p°p  (T(x))  which 
equals  p(G(p'(<x  z>)))  which  equals  p(G(w)). 

1  he  following  abstraction  theorem  is  a  generalization  of  the  first  abstraction  theorem. 


Theorem  4.8,  The  Second  Abstraction  Theorem'.  For  any  objects  x  and  y  and  context  z  the 
following  arc  equivalent: 

1)  Az(y)  contains  Az(x) 

2)  |y|<x  is  a  singleton  set 

3)  y  equals  F(x)  for  some  function  F  which  is  essential  in  the  context  of  z 

Proof.  The  first  condition  is  equivalent  to  the  statement  that  A(<y  z>)  contains  A(<x  />).  ITie 
second  condition  is  equivalent  to  tire  statement  that  j<y  z>|<x  is  a  singleton  set.  Finally  lemma 

4.7  implies  that  the  third  condition  is  equivalent  to  the  statement  that  <y  z>  equals  G(<x  z>)  for 
some  essential  function  G.  Thus  the  equivalence  of  these  three  statements  follow  s  directly  from  the 
first  abstraction  theorem. 


4.5.  Essential  Predicates 


Essential  functions  have  been  shown  to  play  an  important  role  in  characterizing  the  nature  of 
abstractions  and  the  notion  of  a  natural  or  canonical  property.  Essential  predicates  arc  closely  related  to 
essential  functions  and  can  play  much  the  same  role  in  constructing  abstractions. 

Definition:  A  binary  predicate  4)  on  U  is  called  essential  if  for  any  objects  x  and  y  and  point 
permutation  p,  <I>(x  y)  holds  just  in  ease  <t>(p(x)  p(y))  holds. 

The  relationship  between  predicates  and  functions  can  be  made  more  explicit  by  defining  a  monadic 
function  Fq  for  each  binary  predicate 

Definition:  f  or  any  binary  predicate  4>  on  U,  F*  is  the  function  from  U  to  subsets  of  U  such 
that  F*(x)  equals  {y:  d»(x  y)}. 

For  example  let  «t>  be  the  predicate  such  that  <l>(x  y)  holds  just  in  case  y  is  a  pair  whose  first 
component  is  x.  In  this  case  T^fx)  is  the  set  of  all  pairs  whose  first  component  is  x.  Note  that  F^(x)  is  a  large 
set  and  thus  has  no  representation  in  <U  €>.  Thus  in  general  can  not  be  thought  of  as  a  function  from  U 
to  U.  The  following  definitions  will  be  useful  in  discussing  the  functions  associated  with  essential  predicates. 

Definition:  Let  C  be  any  subset  of  U.  P(C)  is  the  union  over  x  in  C  of  P(x).  For  any  point 
permutation  p,  p(C)  is  the  set  |p(x):  x  in  C}.  The  symmetry  group  of  C,  denoted  A(C),  is 
the  set  of  all  point  permutations  p  such  that  p(C)  equals  C.  The  isomorphism  class  |C|Z  of  C 

in  the  context  of  an  object  /.  is  the  family  of  all  sets  which  can  he  written  as  p(C)  for  some 


C 
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point  permutation  p. 

For  any  essential  predicate  $  the  function  •'<!>  commutes  with  point  permutations,  i.e.  F^(p(x)) 
equals  p(F<k(x)).  Thus  F^  can  be  thought  of  as  an  essential  function.  However  .is  the  above  example  shows 
F(1,(x)  can  be  large  and  PfF^fx))  can  be  all  of  P(U).  Thus  some  of  die  theorems  concerning  essential 
functions  do  not  apply  to  F^.  However  many  of  the  results  concerning  essential  functions  can  be  generalized 
to  die  functions  associated  with  essential  predicates, 

Theorem  4.0,  The  Third  Abstraction  Theorem :  For  any  object  z  and  (possibly  large)  subset  C 

of  U  the  following  arc  equivalent: 

1)  A(C)  contains  A(z) 

2)  |C|2  is  a  singleton  family 

3)  C  equals  F*(/.)  for  some  essential  predicate  d». 

The  proof  of  the  above  theorem  is  analogous  to  die  proof  of  the  first  abstraction  dicorcin.  In 
showing  that  1)  implies  3)  the  predicate  <J>  is  defined  by  setting  «b(p(z)  p(y))  to  be  true  for  any  point 
permutation  p  and  any  element  y  of  C,  and  <l>( w  x)  to  be  false  if  <w  x>  can  not  be  written  as  <p(z)  p(y)>  for 
some  y  in  C. 

F.sscntial  predicates  can  be  diought  of  as  defining  abstraction  functions  from  objects  to  more 
abstract  objects.  For  an  essential  predicate  $  and  object  x  it  does  not  seem  very  important  that  F^x)  may  be 
large,  the  important  point  is  that  the  symmetry  group  of  F^fx)  contains  the  symmetry  group  of  x. 

A  good  example  of  the  use  of  essential  predicates  in  defining  abstractions  is  a  multiset.  I  .ct  f  and  g 
be  two  finite  functions  (they  each  have  a  finite  domain).  Hie  functions  f  and  g  will  be  said  to  represent  the 
same  multiset  if  there  exists  a  bijcction  a  from  die  domain  of  f  to  the  domain  of  g  such  diat  for  any  x  in  the 
domain  of  f,  t(x)  equals  g(a(x)).  Intuitively  f  and  g  represent  the  same  multiset  if  for  any  range  clement  y,  f 
and  g  map  the  same  number  of  objects  onto  y.  Let  <1>  be  the  binary  predicate  on  U  such  diat  <t>(f  g)  holds  just 
in  case  f  and  g  arc  finite  functions  which  represent  die  same  multiset.  It  is  easy  to  show  that  <l>  is  an 
equivalence  relation  on  finite  functions  and  that  F',|>(0  is  the  equivalence  class  of  f  under  diis  relation.  The 
symmetry  group  of  Fa(0  is  not  die  full  permutation  group  on  P(U)  but  is  larger  than  the  symmetry  group  of 
f.  Thus  F<j,(  f)  can  be  diought  of  as  die  multiset  abstraction  of  f. 
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5.  RELATION  10  01  TIER  WORK 


The  notions  of  isomorphism,  symmetry,  and  representation  arc  ubiquitous  in  mathematics  and 
probably  have  numerous  independent  origins.  The  relationship  between  symmetry  and  permutation  groups 
is  also  well  known.  This  relationship  has  been  studied  in  some  detail  and  it  has  been  shown,  for  example,  that 
not  every  permutation  group  can  be  represented  as  the  symmetry  group  of  a  graph  [Biggs  74].  But  while  the 
notions  of  isomorphism  and  symmetry  have  been  extensively  used  for  objects  of  a  given  type  (c.g.  graphs, 
groups,  algebras,  languages,  grammars)  these  particular  notions  of  isomorphism  do  not  provide  a  notion  of 
isomorphism  defined  over  all  mathematical  objects. 

Category  theory  provides  one  general  approach  to  the  notion  of  isomorphism.  A  category  can  be 
thought  of  as  a  directed  multigraph  with  an  associative  composition  operator  °  on  arcs  and  for  each  node  n  an 
assigned  "identity”  arc  from  n  to  n.  The  nodes  of  a  category  arc  often  associated  with  sets  and  the  arcs  with 
functions  between  these  sets.  Thus  die  arcs  are  called  "morphisms".  An  isomorphism  is  defined  to  be  an  arc 
p  which  has  an  "inverse"  arc  p"*  such  that  both  p°p  *  and  p'^°p  arc  identity  arcs  [Schubert  72],  Category 
theory  provides  a  general  theory  of  isomorphism  to  the  extent  that  every  mathematical  object  can  be  thought 
of  as  a  node  ("object")  in  a  category.  For  example  a  group  can  be  thought  of  as  a  node  in  the  category  of 
groups,  a  graph  as  a  node  in  the  category  of  graphs,  etc.  However  the  category  containing  a  given  object  must 
be  defined  separately  for  each  type  of  object.  In  fact  the  category  containing  an  object  of  a  given  type  is 
usually  defined  in  terms  of  die  notion  of  isomorphism  (and  homomorphism)  for  objects  of  that  type. 
Therefore  category  theory  docs  not  provide  any  satisfying  general  notion  of  isomorphism  for  arbitrary 
mathematical  objects. 

Ihc  notion  of  a  type  used  in  universal  algebra  and  computer  science  provides  another  approach  to  a 
general  definition  of  isomorphism.  An  algebra  is  a  domain  together  widi  some  functions  defined  over  diat 
domain.  In  die  universal  study  of  algebras  each  particular  algebra  has  a  type  (or  signature  or  language)  w  hich 
is  a  set  of  symbols  which  arc  interpreted  by  that  algebra.  For  example  die  type  of  a  group  is  die  single  binary 
function  symbol  °.  There  is  a  natural  definition  of  isomorphism  for  the  algebras  of  a  fixed  type  such  that  two 
algebras  A  and  B  arc  isomorphic  just  in  ease  there  is  a  bijeclion  from  the  domain  of  A  to  the  domain  of  B 
which  maps  A's  interpretation  of  any  symbol  to  B’s  interpretation  of  dial  symbol. 

The  notion  of  type  also  plays  a  critical  role  in  many  modern  computer  languages  [Tennant  81]. 
There  is  one  particular  outlook  on  die  types  of  computer  data  structures  which  provides  a  basis  for  a  notion  of 
isomorphism.  Under  diis  view  a  type  is  a  collection  of  objects  w  hich  can  be  defined  in  a  "natural"  way  from  a 
collection  of  base  types.  For  example  if  A  and  B  are  base  types  dicn  the  set  of  functions  from  A  to  B  is  also  a 
type.  Similarly  die  set  of  pairs  AX  B  of  an  element  of  A  and  an  element  of  B  is  a  type.  As  another  example  let 
the  type  SubSets(AXA)  be  the  collection  of  all  sets  of  ordered  pairs  of  elements  of  die  base  type  A.  More 
simply  SuhSets(AXA)  is  the  type  containing  till  directed  graphs  whose  nodes  tire  members  of  A.  Several 
people  studying  such  data  types  and  have  employed  permutations  of  the  elements  of  base  types  to  define  a 
notion  of  a  "natural"  function  between  types  [Alio  &  Oilman  79J  |l)unlaing  &  Yap  82J.  In  fact  I  hinl.iing  and 
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Yap  implicitly  use  a  notion  of  isomorphism  based  on  permutations  of  the  elements  of  base  types  in  defining 
die  automorphism  group  of  an  arbitral  y  typed  object. 

While  dicrc  arc  strong  similarities  between  symmetric  set  theory  the  above  mentioned  work  on  data 
types  (especially  that  of  Dunlaing  and  Yap)  there  is  also  an  important  difference.  Symmetric  set  theory  bases 
die  notion  of  isomorphism  on  points  rather  than  types.  When  the  notion  of  isomorphism  is  based  on  points 
die  isomorphism  class  of  an  object  is  an  essential  property  of  the  object  and  docs  not  depend  on  viewing  that 
object  as  an  instance  of  some  type  (or  as  a  member  of  some  category).  Thus  it  can  be  argued  that  die  need  for 
types  (or  categories)  in  defining  die  notion  of  isomorphism  is  a  byproduct  of  the  fact  that  ur-clemcnts  were 
left  out  of  set  theory. 
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7.  APPENDIX:  CONSISTENCY  AND  CHARACTERIZATION  THEOREMS 


This  section  contains  some  results  concerning  the  existence  and  nature  of  universes  <U  €>  of 
symmetric  sets.  It  is  shown  that  the  existence  of  a  universe  <U  €>  of  symmetric  sets  is  equivalent  to  the 
existence  of  a  strongly  inaccessible  cardinal.  It  is  also  shown  that  a  universe  <U  O  of  symmetric  sets  is 
completely  characterized  by  its  "height"  and  "width".  These  results  provide  insight  into  two  obvious 
questions.  First,  are  die  axioms  for  symmetric  sets  consistent?  Second,  to  what  extent  arc  the  axioms 
categorical,  i.e.  under  what  conditions  are  two  universes  isomorphic? 


7.1.  C  onsistency  and  Strongly  Inacccsible  Cardinals 


Iherc  does  not  seem  to  be  any  satisfactory  proof  that  there  exists  a  universe  of  symmetric  sets. 
However  a  simple  condition  can  be  given  which  is  equivalent  to  the  consistency  of  die  axioms  of  symmetric 
set  theory.  In  essence  the  consistency  of  the  axioms  depends  purely  on  die  consistency  of  axioms  three 
through  five  (infinity,  power  set.  and  union).  Axioms  three  dirough  five  characterize  what  is  known  as  a 
strongly  inaccessible  cardinal.  I  hus  die  axioms  of  die  theory  of  symmetric  sets  arc  consistent  just  in  case  diere 
exists  a  strongly  inaccessible  cardinal.  This  result  is  of  interest  because  strongly  inaccessible  cardinals  have 
been  studied  in  the  context  of  Zcrmello-Fracnkcl  set  theory  and  it  is  generally  believed  diat  it  is  impossible  to 
prove  that  strongly  inaccessible  cardinals  do  not  exist.  Ihus  there  is  strong  evidence  diat  die  theory  of 
symmetric  sets  will  never  be  proven  inconsistent. 

Definition :  A  set  U  will  be  said  to  have  a  strongly  inaccessible  cardinality  just  in  case  it  meets 

the  following  conditions: 

1)  U  is  uncountably  infinite 

2)  IfC  is  a  subset  ofU  smaller  than  U  then  the  power  set  of  C  is  also  smaller  than  U. 

.1)  For  any  family  F  of  subsets  of  U  if  F  is  smaller  than  U  and  every  member  of  F  is  smaller 

dian  U  then  the  union  of  all  members  of  F  is  smaller  dian  U. 

Hie  main  result  of  diis  section  will  be  proven  in  two  parts.  First  it  will  be  shown  that  any  set  U 
which  is  larger  dian  some  inaccessible  cardinal  can  be  expanded  to  a  model  <U  O  of  axioms  one  dirough  six 
(the  foundation  axiom  is  initially  ignored).  It  will  dicn  be  shown  diat  any  model  <IJ  €>  of  axioms  one 
through  six  contains  a  substructure  <U'  €>  which  also  satisfies  axiom  seven  (foundation).  These  two  results 
will  lead  directly  to  the  main  result  diat  a  set  U  can  be  expanded  to  a  universe  <U  €>  just  in  case  U  is  larger 
dian  some  strongly  inaccessible  cardinal. 

For  any  set  C  let  Sm(C)  be  die  family  of  all  non-empty  subsets  of  C  which  are  smaller  than  C.  flic 
following  lemma  concerning  Sm(C)  is  a  standard  result  of  set  dieory  and  will  be  stated  without  proof: 

Lemma  A.  I:  For  any  infinite  sctC,  Sm(C)  is  the  same  size  as  C. 

The  next  lemma  is  a  direct  predecessor  to  the  main  residt  of  this  section. 
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Lemma  A.2:  Any  set  U  which  is  larger  than  some  strongly  inaccessible  cardinal  can  be 
expanded  to  a  model  <U  €>  of  axioms  one  through  six. 

Proof:  To  ensure  diat  there  w  ill  be  a  large  number  of  points  a  subset  P(U)  of  U  is  chosen  such  that 
both  l’(L)  and  U-P(U)  are  the  same  size  as  U  (this  can  always  be  done  for  any  infinite  set  U).  Since 
U  is  larger  than  some  strongly  inaccessible  cardinal  we  can  choose  some  subset  U'  of  L'  with  a 
strongly  inaccessible  cardinality,  l  et  Sm(U  U’)  denote  the  family  of  all  non-empty  subsets  of  U 
which  are  smaller  than  U'.  Since  since  Sm(U  U')  is  a  subset  of  Sm(U)  lemma  A.l  implies  that 
Sm(l!  U')  can  be  no  larger  than  L'.  On  die  other  hand  Sm(U  U’)  contains  all  die  singleton  subsets 
of  U  and  thus  Sm(U  U  )  is  as  large  as  U.  Tints  Sm(U  U')  has  the  same  cardinality  as  U  and  there  is 
a  bijectioti  f  from  U-P(Lf)  to  Sm(U  U')  such  that  each  clement  x  of  U-P(U)  represents  some 
non-empty  subset  fix)  of  U  which  is  smaller  titan  U'  and  every  such  subset  has  a  unique  such 
representation.  The  relation  €  is  now  defined  such  that  x€y  just  in  case  y  is  in  U-P(U)  and  and  x  is 
in  ffy).  The  resulting  stmeture  <U  €>  clearly  satisfies  axioms  one  and  two  (cxtensionalily  and 
strong  comprehension).  A  subset  of  U  is  small  in  <U  €>  just  in  case  it  is  smaller  dian  U’.  Hie 
definition  of  a  set  with  strongly  inaccessible  cardinality  now  directly  implies  diat  <U  €>  satisfies 
axioms  three  dirough  five  (infinity,  power  set,  and  union).  The  fact  diat  P(U)  is  as  big  as  U  implies 
diat  <U  €>  sadsfics  axiom  six. 

It  can  also  be  shown  that  any  model  of  axioms  one  through  six  can  be  used  to  generate  a  model  which  also 
satisfies  foundation. 

Lemma  A.3:  Any  model  <U  €>  of  axioms  one  through  six  contains  a  substructure  which  is  a 
model  of  axioms  one  through  seven. 

Proof.  An  element  x  of  U  will  be  called  a  well  founded  clement  if  there  arc  no  infinitely 
decreasing  €  chains  containing  it.  Let  U'  be  the  subset  of  U  consisting  of  the  well  founded 
elements  of  U.  The  substructure  <U'  O  clearly  satisfies  die  foundation  axiom  so  it  is  sufficient  to 
show  that  it  also  satisfies  axioms  one  dirough  six.  Note  that  every  point  is  well  founded  so  P(U)  is 
contained  in  U'  and  dius  by  the  point  comprehension  axiom  U’  is  a  large  subset  of  U.  If  x  is  a  well 
founded  element  of  U  dial  every  y  such  dial  y€x  is  also  well  founded  and  thus  any  well  founded 
element  represents  die  same  subset  of  U  whether  it  is  viewed  as  an  clement  of  <U  €>  or  as  an 
clement  of  <U'0.  Since  no  two  elements  of  U  represent  die  same  set  under  <U  €>  no  two 
elements  of  U’  represent  the  same  set  under  <U’  €>  and  so  <U'  €>  satisfies  cxtensionalily.  To  show 
that  <U'  €>  satisfies  the  strong  comprehension  axiom  let  C  be  any  subset  of  U'  which  is  small  with 
respect  to  <U  C>,  i.c.  there  is  an  x  in  U  which  represents  C.  Since  every  member  of  C  is  well 
founded  x  must  also  be  well  founded  and  thus  x  is  in  U'  and  thus  C  is  represented  in  <U’  €>.  Thus 
every  subset  of  U'  which  is  small  with  respect  to  <U  £>  is  represented  in  <U’  £>.  On  the  other 
hand  no  set  which  is  large  in  <U  O  can  be  represented  in  <U’  €>.  Thus  a  subset  of  U’  is 
represented  in  <U’  €>  just  in  case  it  is  small  with  respect  to  <U  €>.  The  fact  that  U’  is  large  and  diat 
a  subset  of  U‘  is  small  in  <U'  €>  just  in  case  it  is  small  in  <U  £>  immediately  implies  diat  <U'  O 
satisfies  the  axioms  of  infinity,  power  set  and  union.  Since  U’  contains  P(U)  die  number  of  points 
in  U’  is  large  so  <U‘  €>  also  satisfies  the  point  comprehension  axiom. 

I  emmas  A.2  and  A.3  lead  directly  to  the  main  result  of  this  section. 

Theorem  A.4:  A  set  U  can  be  expanded  to  a  model  <U  €>  of  the  axioms  of  symmetric  set 
theory  just  in  case  U  is  larger  than  some  strongly  inaccessible  cardinal. 

Proof:  If  U  is  larger  than  some  strongly  inaccessible  cardinal  then  by  lemma  A.2  it  can  be 
expanded  to  a  model  <U  O  of  axioms  one  dirough  six  such  that  l’(U)  is  the  same  size  as  U.  By 
lemma  A.3  there  is  a  substructure  <U’  €>  of  <U  €>  which  contains  l*( U )  and  which  satisfies  all  of 
the  axioms  of  symmetric  set  theory.  But  since  U'  has  the  same  cardinality  as  U  it  is  also  possible  to 
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directly  extend  U  to  a  model  <U  €>  of  all  of  the  axioms. 

On  the  other  hand  if  U  can  be  expanded  to  a  model  <U  €>  of  axioms  two  through  five  (strong 
comprehension,  infinity,  power  set,  and  union)  then  it  is  easily  shown  that  U  must  be  larger  than 
some  strongly  inaccessible  cardinal. 


7.2.  The  Height  and  Width  of  a  Universe 


A  universe  <U  €>  of  symmetric  sets  is  characterized  (up  to  isomorphism)  by  two  "numbers”,  its 
height  and  its  width.  The  height  and  width  of  a  universe  <U  €>  are  defined  as  follows: 

Definition:  The  width  of  a  universe  <U  O  is  defined  to  he  the  cardinality  of  its  set  of  points 
P(U).  A  subset  C  of  U  will  be  said  to  be  minimally  large  in  <U  €>  if  no  subset  of  C  which  is 
smaller  than  C  is  large  in  <U  £>.  The  height  of  a  universe  <U  O  is  defined  to  be  the 
cardinality  of  any  minimally  large  subset  of  U. 

The  following  lemma  can  be  demonstrated  directly  from  the  comprehension  axioms. 

Lemma  A.5:  The  height  of  any  universe  <U  €>  is  a  strongly  inaccessible  cardinal. 

The  constructions  used  in  the  consistency  theorems  of  the  previous  section  show  that  for  any 
strongly  inaccessible  cardinal  there  is  a  universe  with  that  height.  Since  P(U)  is  required  to  be  a  large  set  the 
width  of  a  universe  must  always  be  at  least  as  large  as  its  height.  The  constructions  used  in  the  consistency 
theorems  further  show  that  the  width  of  a  universe  can  be  any  cardinality  larger  than  its  height 

A  universe  can  be  thought  of  as  a  rectangle  which  is  no  higher  than  its  width.  The  points  of  the 
universe  should  be  thought  of  as  lying  along  the  bottom  edge  of  this  rectangle.  The  main  theorem  of  this 
section  can  be  proven  directly. 

Theorem  A.6:  Any  two  universes  of  symmetric  sets  with  the  same  height  and  width  are 
isomorphic. 

Proof.  I.ct  the  universe  be  <U^  €]>  and  <1^  £2^-  Since  both  universes  have  the  same  height  a 

set  is  small  with  respect  to  one  universe  just  in  case  it  is  small  with  respect  to  die  other  universe. 

Since  P(Uj)  and  Pd^)  arc  the  same  size  there  exists  a  bijcction  p  from  P(Uj)  to  Pfl^).  The 

function  p  can  be  extended  to  a  function  a  from  all  of  Uj  into  U2  via  die  following  inductive 

definition: 


o(p)  =  p(p)  for  any  point  p  in  P(Uj) 

o(x)  =  the  representation  for  {<r(y):  y£  jx)  for  x  not  a  point 

The  set  on  the  right  side  of  the  second  equation  is  guaranteed  to  have  a  representation  in  Uj 
because  it  is  no  larger  than  the  set  represented  hy  x.  ft  is  easy  to  show  by  induction  on  x  under 
€  I  v  that  a(y)£2<r(x)  just  in  case  yC  jx.  To  show  that  o  is  a  bijcction  it  is  sufficient  to  construct  an 
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inverse  function  a'*  such  that  for  any  x  in  Uj  a’^(o(x))  equals  x  and  for  any  y  in  U2  0(0  '(y)) 
equals  y.  The  inverse  function  a  ^  is  defined  by  extending  p  ^  from  PfUj)  to  all  of  U  via  a  relation 
analogous  to  that  above.  The  two  conditions  relating  a  and  a  ^  can  then  be  proven  by  induction  on 
and  €2*  respectively. 

There  a  few  other  results  which  help  to  characterize  a  universe  <U  €>.  These  results  will  be  stated 
briefly  without  proof.  First  it  can  be  shown  that  in  general  the  size  of  U  equals  the  size  of  P(U)  (which  is  at 
least  as  large  as  the  height  of  U).  Second  the  notion  of  "rank"  used  in  ZF  set  theory  can  also  be  defined  for 
symmetric  sets.  The  details  of  this  definition  are  not  important  but  one  result  concerning  a  characterization  of 
small  sets  will  be  mentioned.  For  any  subset  C  of  U  let  P(C)  be  the  union  over  x  in  C  of  P(x).  It  turns  out 
that  a  subset  C  of  U  is  small  just  in  case  P(C)  is  small  and  the  rank  of  C  is  less  than  the  height  of  the  universe. 


